Distributed n-player approachability via time and space average consensus

نویسندگان

  • Dario Bauso
  • Giuseppe Notarstefano
چکیده

In this paper we consider repeated coalitional games with transferable utilities (TU) over networks. Namely, we consider a set of n players that have to distribute among themselves a vector of rewards (one for each player). In our network version there is no coordinator allocating the rewards, but the agents have to agree on a common time-averaged vector by updating the local estimates of the reward vector. The common time-averaged reward vector has to approach a suitable constraint set, called core of the game, that guarantees that no agents benefit from quitting the grand coalition. We propose a doubly (over time and space) averaging distributed algorithm. At every iteration, each agent first computes a weighted average of its own timeaveraged estimate and those of his neighbors and then generates a new reward vector in order to drive the time-averaged estimate towards a pre-assigned set. The main contribution of the paper is to prove that under certain assumptions, i) all agents’ estimates reach consensus on the true time-averaged reward vector, and ii) the estimates (and thus the true time-averaged reward vector) approach the pre-assigned set. Conditions for this to happen are related to the connectivity over time of the communication topology and to the approachability principle.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Approachability in repeated games: Computational aspects and a Stackelberg variant

We consider a finite two-player zero-sum game with vector-valued rewards. We study the question of whether a given polyhedral set D is “approachable,” that is, whether Player 1 (the “decision maker”) can guarantee that the long-term average reward belongs to D, for any strategy of Player 2 (the “adversary”). We examine Blackwell’s necessary and sufficient conditions for approachability, and sho...

متن کامل

Approachability in Repeated Games: Computational Aspects and a Stackelberg Variant1

We consider a finite two-player zero-sum game with vector-valued rewards. We study the question of whether a given polyhedral set D is “approachable,” that is, whether Player 1 (the “decision maker”) can guarantee that the long-term average reward belongs to D, for any strategy of Player 2 (the “adversary”). We examine Blackwell’s necessary and sufficient conditions for approachability [3], and...

متن کامل

Approachability in Population Games∗

This paper reframes approachability theory within the context of population games. Thus, whilst a player still aims at driving her average payoff to a predefined set, her opponent is no longer malevolent but rather extracted randomly at each instant of time from a population of individuals choosing actions in a similar manner. First, convergence conditions are revisited based on the common prio...

متن کامل

Approachability, fast and slow

Approachability has become a central tool in the analysis of repeated games and online learning. A player plays a repeated vector-valued game against Nature and her objective is to have her long-term average reward inside some target set. The celebrated results of Blackwell provide a 1/ √ n convergence rate of the expected point-to-set distance if this is achievable, i.e., if the set is approac...

متن کامل

Zero - Sum Games with Vector - Valued Payoffs

In this lecture we formulate and prove the celebrated approachability theorem of Blackwell, which extends von Neumann's minimax theorem to zero-sum games with vector-valued payoffs [1]. (The proof here is based on the presentation in [2]; a similar presentation was given by Foster and Vohra [3].) This theorem is powerful in its own right, but also has significant implications for regret minimiz...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2012